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Description 

[METHOD OF MOTION DETECTION FOR 
3D COMB FILTER VIDEO DECODER] 

Cross Reference to Related Applications 

[0001] This application claims the priority benefit of Taiwan ap- 
plication serial no. 92136373, filed on Dec. 22, 2003. 
Background of Invention 

[0002] Field of Invention 

[0003] jhe present invention relates to a method of motion de- 
tection. More particularly, the present invention relates to 
a 3-dimensional (3D) comb filter video decoder in NTSC 
type. 

[0004] Description of Related Art 

[0005] | n the modern daily life, people can see various events 
without stepping out from a living place. For example, 
people can see images of landscape scene, news, or play- 
ing program, which are transmitted from the TV station to 
the TV apparatus. Even further, for the monitoring system 



of the local community, people at home can see the moni- 
toring image taken by the camera, so as to know the out- 
side status. The foregoing various video systems have 
their individual function and purpose. However, the video 
signals should be transmitted from the transmitting side 
to the receiving side. 
[0006] The color is basically composed by three primary colors of 
red (R), green (G), and blue (B). In this manner, when the 
transmitting side intends to transmit the video image, the 
color information for R, G, and B are converted into elec- 
tric signals and the electric signals are transmitted out. 
However, due to the transmitting bandwidth having its 
limitation, in order to save the bandwidth, the color infor- 
mation of R, G, and B are transformed into information of 
luminance (luma) and chroma. For example, the color 
space of Y(luma)UV(chroma) is an example for transform- 
ing the RGB information into luminance and chroma. The 
relation between the RGB color space and the YUV color 
space is Y = 0.299R + 0.587G + 0.114B; U = 0.493( B - Y 
); V = 0.877( R - Y ). In Y equation, the coefficients for R, 
G, and B are the sensitive level for the human eye to the 
three primary colors. U and V represent blue and red after 
the luma is removed. For a white light with R=G=B, the 



quantity of U and V are both zero, which represents no 
color. 

[0007] During the process of transmitting the signals, the 

chroma data should be modulated into a subcarrier signal 
and is mixed with the luma data. As a standard made by 
National Television Standards Committee (NTSC), the Y, U, 
V information are modulated into a composite video sig- 
nal of Y + U sin(oot) + V cos(o)t) and then transmitted, 
wherein (jo=2tt (Fsc), and Fsc is the subcarrier frequency. 

[0008] After receiving the composite video signal at the receiver, 
the signal is sampled. A comb filter, usually, samples the 
composite video signal by a frequency with four times of 
Fsc. As a result, each NTSC horizontal line has 910 sam- 
pling points, and an NTSC frame has 525 horizontal lines. 
The total number of sampling points is 910*525 = 
477750. Since the number of sampling points for the 
whole image frame is not an integer factor of the number 
of horizontal lines, a phase difference would occur at the 
different sampling location. 

[0009] | n general, the most difficult part on the technology in the 
TV decoder is the separation of the luma signal and the 
chroma signal. The separation quality of the luma and the 
chroma would affect the decoding quality of the decoder. 



For this reason, in the current application with the re- 
quirement of high-quality image, the technology of 3D 
comb filter is taken to achieve the separation of luma and 
chroma. 

[0010] when the composite video signal is decoded by the 3D 

comb filter, the composite video signal is sampled by ev- 
ery 90 degrees of the phase angle. Taking the NTSC as the 
case, when sampling phases are at 0, 0.5tt, tt, and 1.5tt, 
the quantities of Y+V, Y+U, Y-V, and Y-U are respectively 
obtained. FIG. 1 is a drawing, schematically illustrating a 
portion of sampling result in the frame for the NTSC sys- 
tem. In FIG. 1, the vertical axis is the position x of the 
horizontal line in the frame, and the horizontal axis is the 
position y of the pixel in the horizontal line. When the two 
sampling data are corresponding to the same position but 
in the adjacent frames, the two sampling points has a 
separation by 477750 sampling points (factors of 4 with 
remainder of 2), and then the two phases are just have the 
difference of 180 degrees. The foregoing sampling rela- 
tion for the adjacent frames can be shown in FIG. 1, but 
the vertical axis is treated as the sequence m of the frame, 
that is, the vertical axis is the time axis. 

[0011] FIG. 2A is a block diagram, illustrating the conventional 



3D comb filter. In FIG. 2A, usually, the 3D comb filter in- 
cludes inter-frame Y/C separator 210, intra-field Y/C 
separator 220, that is, 2D comb filter, motion detector 
230, memory 240, and mixer 250. The composite video 
signal 201 is obtained by sampling, wherein F repre- 

m+l 

th 

sents the composite video signal 201 for the (m+1) 
frame. The memory 240 stores the composite video signal 
201 and provides the composite video signal 202 and the 
composite video signal 205, wherein F represents the 

m 

th 

composite video signal for the m frame. The intra-field 
Y/C separator 220 receives the composite video signal 
205, and uses the space relation between the pixels in the 
frame F to perform the Y/C separation, and then exports 

m 

the separated video signal 221. 
[0012] usually, the separation for the motion video signal is done 
by using the intra-field Y/C separator 220. However, 
when the intra-field Y/C separator 220 processes the still 
video signal, it causes the drawbacks of blur edge. In or- 
der to improve the image quality, conventionally, the still 
video signal is processed by the inter-frame Y/C separa- 
tor 210. The inter-frame Y/C separator 210 receives the 
composite video signals for the frames F and F at the 

3 m+l m 

same time, and the timing relation between the corre- 



sponding pixels in the adjacent frames F and F is 

m+l m 

used for Y/C separation, and then the separated video 
signal 211 is exported. The motion detector 230 is used 
to judge whether or not the composite video signal 201 is 
in motion state or in still state. The conventional motion 
detector 230 receives the composite video signal 201 and 
the luminance data 221a that is provided by the separated 
video signal 221. The luminance data 221a and the com- 
posite video signal 201 are used to calculate the differ- 
ence of luma and chroma between the two frames, which 
are used to judge the pixel state of motion or still, and 
then a selection signal 231 is exported. The mixer 250, 
according to the selection signal 231, selects the sepa- 
rated video signal 221, the separated video signal 211, or 
the mixed signal by a predetermined ratio, and the ex- 
ports the separated video signal 251. 
[0013] The motion detector 230 is the essential part for the 3D 
comb filter. When an error of treating motion state as the 
still state, it causes an obvious mistake on the image. 
However, if most of judgments in conservation manner 
are treated as the motion state, then the effect of the 3D 
comb filter is reduced. The convention method for detect- 
ing the motion is respectively calculating out Y/C values 



for the previous frame and the current frame, and com- 
paring the difference. FIG. 2B is a block diagram, illustrat- 
ing the conventional motion detector for the 3D comb fil- 
ter. In FIG. 2B, for the NTSC, after the composite video 
signal 201 goes through the low pass filter (LPF) 260, the 
data approximate to the luma data 232 can be obtained. 
The frame buffer 291 causes a delay for the frame and 
then the luma data 233 for the previous frame is ob- 
tained. It is compared for the current luma data 232 with 
the previous luma data 233 to obtain the luma difference 
234. In addition, after the composite video signal 201 
goes through the band pass filter (BPF) 270 and a sub- 
traction of the luma data 221a, which is provided by the 
separated video signal calculated by the intra-field Y/C 
separator 220, is performed, so as to obtain chroma data 
236. The frame buffers 292, 293 are used to have the de- 
lay by two frames, so as to obtain the chroma data 238 
for the previous second frame. The chroma data 236 is 
subtracted by the previous second chroma data 238, so at 
to obtain the chroma difference 239. After the detection 
circuit 280 obtains the luma difference 234 and the 
chroma difference 239, the maximum is treated as the 
motion factor. 



[0014] Conventionally, when the composite video signal 201 is 
judged for the motion/still state, quantity of the motion 
factor is often compared with a predetermined threshold 
value. If the motion factor is clearly greater than the 
threshold value, then it is judged as a motion state, in 
which the detection circuit 280 exports the selection sig- 
nal 231 to use the intra-field Y/C separator 220. If the 
motion factor is clearly less than the threshold value, then 
it is judged to be the still state. The detection circuit 280 
exports the selection signal 231 to use the inter-frame Y/ 
C separator 210. The image quality is improved. If the 
motion factor is around the threshold value, it is improper 
to determine the motion state or the still state. Usually, 
the luma data and the chroma data respectively calculated 
by the intra-field Y/C separator 220 and the inter-frame 
Y/C separator 210 are mixed by a proper ratio, so as to 
treat this ambiguous situation. Therefore, if the motion 
factor is more tending to be difficult in convergence, the 
ambiguous region is certainly larger. The benefit from the 
3D comb filter would be reduced a lot. 

[0015] | n conventional method for detecting the motion, the Y/C 
data are first calculated by a simple 2D Y/C separation, 
and then the Y/C data for the current frame is compared 



with that of the previous frame. According the comparing 

result, the Y/C data to be exported is determined. Here, it 

is a dilemma about the question of which one is the first 

for the chicken or egg. If the 2D Y/C separation process 

can be performed at the beginning to precisely separate 

the luma data and the chroma data, the 3D comb filter is 

then not necessary, and the motion factor is not necessary 

to be calculated. However, if the calculation for the luma 

data and the chroma data at the beginning has the error, 

then it also has the error for calculating the motion factor 

by using the luma data and the chroma data with error. 

The precision becomes much poor when the motion factor 

with error is used to determine the final luma data and the 

chroma data. 
Summary of Invention 

[0016] The invention provides a method of motion detection for a 
3D comb filter video decoder in NTSC system. The original 
composite video signal is used as the reference for motion 
detection. 

[0017] The invention provides a method of motion detection for a 
3D comb filter video decoder in NTSC system. The method 
first samples the composite video signal to obtain and 
store a plurality sampling data F P , wherein F P rep- 

m x,y m x,y 



resents the sampling data for the composite video signal 
in the m th frame at the x th line and the y th pixel, in which 
m, x, and y are integer number greater than 0. The data of 
F P ,F P ,F P , and F P are used to judge 

m +1 x,y m x,y m -1 x,y m -2 x,y 

the composite video signal to be motion state or still 
state. 

[0018] According to the method of motion detection for a 3D 

comb filter video decoder of the invention, the process to 
judge the composite video signal to be motion state or 
still state includes using F P , F P , F P , and F 

m +1 x,y m x,y m -1 x,y m 

P to calculate and obtain a number of maximum d if— 

-2 x,y 

ferences MD , in which MD represents the maximum 

x,y x,y 

difference for the point at the x line and they pixel. 
After then, the maximumdifference values for any four of 
the adjacent pixels is taken and an average value is ob- 
tained by the averaging operation. A number of motion 

factors MF are thereby obtained, and the judgment is 

x>y 

determine the motion state or the still state for the com- 

th th 

posite video signal at the x line and the y pixel. 
[0019] According to the method of motion detection for a 3D 

comb filter video decoder of the invention, the process for 
sampling the composite video signal is using a sampling 
frequency, which is four times of the frequency of the 



subcarrier signal in the composite video signal. In addi- 
tion, the subcarrier signal is sampled at the phases of 0, 
0.5tt, tt, and 1.5tt. The formula for calculating the MD 

x,y 

is MD = Max{|F P-F P I , |F P-F P | }. 

x,y m x,y m-2 x,y m+1 x,y m-1 x,y 

[0020] According to the method of motion detection for a 3D 
comb filter video decoder of the invention, the MF is 

x,y 

obtained by first selecting the maximum differences for 

any four adjacent pixels including the MD and taking an 

x>y 

average, so as to obtain a number of averaged maximum 
differences AMD , wherein AMD represents the aver- 

x,h x,h 

th th 

aged maximum difference for the h pixel in x line and 
h is an integer. The calculation formula is AMD = (MD 

x,h x,h 

+ MD +MD +MD ) / 4. A minimum is ob- 

x,h+l x,h+2 x,h + 3 

tained from a number of adjacent averaged maximum dif- 
ferences, and a motion factor MF is obtained, wherein 

x ' y th 
the MF represents the motion factor for the y pixel in 

th x ' y 

the x line. The calculation formula, for example, is MF 

x,y 

= Min(AMD , AMD , AMD , AMD ), or MF = 

x,y x,y-l x,y-2 x,y-3 x,y 

Min(AMD , AMD ), and so on. They are within the in- 

x,y x,y-3 

vention scope. 

[0021] According to the method ofmotion detection of the inven- 
tion, the process for detecting the motion factor MF to 

x,y 

judge whether the composite video signal to be the mo- 



tion state or the still state for the y pixel in the x line 

comprises providing a threshold value. And then, the MF 

th x ' y 

is compared with the threshold value, wherein the y 

th 

pixel in the x line of the composite video signal is 
judged as the motion state when the MF^ is greater than 
the threshold value, otherwise the still state is judged. 

th 

Wherein, the MF is the motion factor for the m frame. 

[0022] | n the invention, the motion detection is based on the 
composite video signal without separation of luma and 
chroma. As a result, the motion detection can be precisely 
judged, and the advantage for the 3D comb filter video 
decoder can have full performance. 

[0023] it is to be understood that both the foregoing general de- 
scription and the following detailed description are exem- 
plary, and are intended to provide further explanation of 

the invention as claimed. 
Brief Description of Drawings 

[0024] The accompanying drawings are included to provide a 

further understanding of the invention, and are incorpo- 
rated in and constitute a part of this specification. The 
drawings illustrate embodiments of the invention and, to- 
gether with the description, serve to explain the principles 
of the invention. 



[0025] FIG. 1 is a drawing, schematically illustrating a portion of 
sampling result in the frame for the NTSC system. 

[0026] FIG. 2A is a block diagram, schematically illustrating the 
conventional 3D comb filter. 

[0027] FIG. 2B is a block diagram, illustrating the conventional 
motion detector for the 3D comb filter. 

[0028] FIG. 3 is a process diagram, schematically illustrating the 
method of motion detection for a 3D comb filter video de- 
coder under NTSC system, according to an embodiment of 
the invention. 

[0029] FIG. 4 is a block diagram, schematically illustrating the 

application of the method of motion detection for a 3D 

comb filter video decoder under NTSC system, according 

to an embodiment of the invention. 
Detailed Description 

[0030] | n the invention, a novel approach for robust motion de- 
tection is proposed. FIG. 3 is a process diagram, schemat- 
ically illustrating the method of motion detection for a 3D 
comb filter video decoder under NTSC system, according 
to an embodiment of the invention. Referring to FIG. 1 
and FIG. 3, in step S301, the composite video signal is 
sampled to obtain multiple sampling data F m P xy , which 
represents the y th pixel of the X th line in the m 1 ^ frame, in 



which m, x, y are positive integer. In the embodiment un- 
der the NTSC, a sampling frequency is, for example, four 
times of the frequency of the composite video signal, for 
sampling at the phases of 0, 0.5tt, tt, and 1.5tt of the 
subcarrier signal. 
[0031] | n s tep S302, the maximum difference MD is calculated 

x,y 

byMD = Max{|F P -F P|,|F P-F P|}, 

x,y m x,y m-2 x,y m+1 x,y m-1 x,y 

wherein MD is the maximum difference for the y pixel 

th x,y 

of the x line. In the embodiment, the MD is for exam- 

th x ' y 

pie the maximum difference for the m frame. The NTSC 
system, which has the vertical axis for frame and the hori- 
zontal axis for pixel, is taken as the example shown in 
FIG. 1. The y th pixels for the frame m and the previous 
second frame m-2 have the quantities of Y+U, but have 
the quantities of Y-U for the previous first frame m-1 and 
the advanced first frame m+1. A subtraction is performed 
between Y+U and Y+U, and an absolute value is taken for 
average. Also, a subtraction is performed between Y-U 
and Y-U, and an absolute value is taken for average. The 
larger one of the two absolute values is the MD 
[0032] However, it is insufficient to only use the maximum dif- 
ference MD of each pixel for the motion factor. This is 
x,y 

because each sampling point represents a different char- 



acter. It is assumed that each pixel is red for the frame 
m + 1, and the pixels for the other frames are white. The 
luma Y for white color is much larger than the luma Y for 
red color, and the chroma V for red color is much larger 
than the chroma V for white color, and the chroma U for 
both has slight difference but without significant differ- 
ence. In this situation, if the Y+V for the white color is 
compared with Y+V for the red color, the difference is 
small. However, if the Y-V for the white color is compared 
with Y-V for the red color, the difference is exceptional 
large. In addition, if the Y+U is compared with Y-U, the 
difference is small. In other words, for the same color A 
and the same color B to be compared, if the comparing 
item is different, the compared results have the differ- 
ences. If the motion factor is determined according to 
each pixel itself, a sine wave for the values of the motion 
factor would appear. In this consideration, four pixels as a 
group are used to determine the values of motion factor. 
[0033] For the foregoing reasons, after the maximum difference 

MD for each pixel is calculated out, the maximum d if— 

x,y 

ferences MD for any adjacent four pixels are taken for 
calculating the average, (wherein the maximum difference 
for the target pixel is included). A minimum from the av- 



eraged values is selected to serve as the motion factor. 
The calculation formula is, for example, AMD = ( MD 

x,h x,h 

+ MD + MD + MD ) / 4 and MF = Min(AMD 

x,h+l x,h+2 x,h+3 x,y 

, AMD , AMD , AMD ), in which AMD repre- 

x,y x,y-l x,y-2 x,y-3 x.^h 

sents the averaged maximum difference for the h pixel 

th 

of the x line, and MD represents the motion factor for 

x,y 

th th 

they pixel of the x line. In the embodiment, only the 
left averaged maximum difference AMDL , that is, AMD 

x,y 

, and the right averaged maximum difference AMDR 

x,y-3 

, that is, AMD as the examples for respectively calcu- 

x,y x,y 

lating the AMDL (step S303) and AMDR (step S304). In 

x,y x,y 

step S305, the minimum of the AMDL and AMDR is 

x, x,y 

taken as the motion factor by MF = Min(AMDL , AMDR 

x,y x,y 

). The reason to take the minimum is that since the mo- 

x,y 

tion factor for the pixel P is affected by about 7 adja- 

x,y 

cent pixels. If the pixel P is at still state, it does not 

x,y 

wants to be affected by the adjacent pixel, causing an er- 
ror judgement as a motion state. Therefore, taking the 
minimum form those averaged maximum differences to 
treat as the motion factor is proper. 
[0034] Before judging whether the composite video signal is at 
the motion state or the still state, a threshold is provided 
(step S306). In step S307, the motion factor is compared 



with the threshold. In step S308, if the motion factor is 
significantly greater than the threshold, then it is deter- 
mined as a motion state. In step S309, if the motion factor 
is significantly less than the threshold, then it is deter- 
mined as a still state. 
[0035] | n summary, application of the method of motion detec- 
tion for a 3D comb filter video decoder is described by 
block diagram. FIG. 4 is a block diagram, schematically il- 
lustrating the application of the method of motion detec- 
tion for a 3D comb filter video decoder under NTSC sys- 
tem, according to an embodiment of the invention. In FIG. 
4, the application of the invention of the motion detection 
for a 3D comb filter video decoder includes several inter- 
frame Y/C separator 410, intra-field Y/C separator 420, 
that is, 2D comb filter, motion detector 430, memory 440, 
and mixer 450. The motion detector 430 includes the fea- 
tures of the invention. The composite video signal 401 is 
obtained by sampling, wherein F represents the com- 

m+ 1 

posite video signal 401 for the (m + 1) frame. The mem- 
ory 440 stores the composite video signal 401 and pro- 

th 

vides the composite video signal 402 for the m frame F 
, the composite video signal 403 for the (m-l) th frame F 

m 

th 

, and the composite video signal 404 for the (m-2) 

m-l ~* 



frame F . The memory 440 also separately provides a 

m-2 

th 

composite video signal 405 for the m frame F . The in- 

m 

tra-field Y/C separator 420 receives the composite video 
signal 405, and uses the space relation between the pixels 
in the frame F to perform the Y/C separation, and then 

m 

exports the separated video signal 421. 
[0036] The luma and chroma of the motion video signal is sepa- 
rated by the intra-field Y/C separator 420. In order to im- 
prove the image quality, the still video signal is processed 
by the inter-frame Y/C separator 410. The inter-frame Y/ 
C separator 410 simultaneously receives the sampling 
data of the composite video signal for the frames F , F 

m+ 1 

, F , and F , and uses the space relation between the 

m m-1 m-2 

corresponding pixels in the adjacent frames, so as to per- 
form the separation of luma and chroma and then export 
the separated video signal 411. The function forjudging 
the composite video signal 401 to be motion state or still 
state is performed by the motion detector 430. The mo- 
tion detector 430 receives the sampling data for the 
frames F , F , F , and F in the composite video 

m+l m m-1 m-2 

signal, so as to judge whether the pixel is at the motion 
state or the still state, and then the selection signal 431 is 
exported. The judging method is similar to the foregoing 



description, and is not further described. The mixer 450, 
according to the selection signal 431, selects the sepa- 
rated video signal 421, the separated video signal 411, or 
a mix of these two signals with a predefined ratio, so as to 
export the separated video signal 451. 
[0037] | t w j|| De apparent to those skilled in the art that various 
modifications and variations can be made to the structure 
of the present invention without departing from the scope 
or spirit of the invention. In view of the foregoing, it is in- 
tended that the present invention covers modifications 
and variations of this invention provided they fall within 
the scope of the following claims and their equivalents. 



